3574 results found.
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
OpenSource
Size:
10 GByte Production Status:
Newly created-finished
Use:
Summarisation
-
Paper title:A Multi-level Annotated Corpus of Scientific Papers for Scientific Document Summarization and Cross-document Relation Discovery
-
Paper track:Written/poster presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Ahmed AbuRa'ed | A Multi-level Annotated Corpus of Scientific Papers for Scientific Document Summarization and Cross-document Relation Discovery | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
4M entries Production Status:
Newly created-finished
Use:
Corpus Creation/Annotation
-
Paper title:SOLO: A Corpus of Tweets for Examining the State of Being Alone
-
Paper track:Written/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Svetlana Kiritchenko | SOLO Tweet Corpus | /N |
Documentation:
documentation in English
Written
Ontology,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
CreativeCommons
Size:
70 pages OtherProduction Status:
Newly created-finished
Use:
Corpus Creation/Annotation
-
Paper title:The Medical Scribe: Corpus Development and Model Performance Analyses
-
Paper track:Speech/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Izhak Shafran | The Labeling Guidelines for the Scribe Tasks | /N |
Documentation:
English
Multimodal/Multimedia
Corpus,
Language Type:
Bilingual
Languages:
English Japanese
Availability:
From Owner
License:
TBA
Size:
10 hours Production Status:
Newly created-finished
Use:
Dialogue
-
Paper title:The AICO Multimodal Corpus – Data Collection and Preliminary Analyses
-
Paper track:Multimodality/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Kristiina Jokinen | AICO Corpus | /N |
Documentation:
yes
Written
Corpus,
Language Type:
Multilingual
Languages:
Dutch English French Portuguese
Availability:
Freely Available
License:
Apache-2.0
Size:
31403 translation units OtherProduction Status:
Newly created-finished
Use:
Evaluation/Validation
-
Paper title:A Post-Editing Dataset in the Legal Domain: Do we Underestimate Neural Machine Translation Quality?
-
Paper track:Written/poster presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Julia Ive | Post-Editing Dataset in the Legal Domain | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Owner
License:
GNU
Size:
2540 sentences Production Status:
Newly created-finished
Use:
Corpus Creation/Annotation
-
Paper title:Adjusting Image Attributes of Localized Regions with Low-level Dialogue
-
Paper track:Multimodality/poster presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Tzu-Hsiang Lin | Imperative Low-level Complete Image Edit Requests | /N |
Documentation:
None
Multimodal/Multimedia
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
CC-BY-SHAREALIKE
Size:
900K tokens Production Status:
Newly created-in progress
Use:
Natural Language Generation
-
Paper title:Object Naming in Language and Vision: A Survey and a New Dataset
-
Paper track:Multimodality/poster presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Sina Zarrieß | ManyNames | /N |
Documentation:
available in English
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Owner
License:
Size:
66 conversation/notes OtherProduction Status:
Existing-used
Use:
Natural Language Generation
-
Paper title:Alignment Annotation for Clinic Visit Dialogue to Clinical Note Sentence Language Generation
-
Paper track:Written/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Wen-wai Yim | dialogue2note sentence alignments | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
OpenSource
Size:
7519 dialogues OtherProduction Status:
Newly created-finished
Use:
Dialogue
-
Paper title:A Corpus of Controlled Opinionated and Knowledgeable Movie Discussions for Training Neural Conversation Models
-
Paper track:Written/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Fabian Galetzka | KOMODIS-dataset | /N |
Documentation:
Yes in english. Yes, will be publicly available.
Multimodal/Multimedia
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Owner
License:
Size:
5562 entries Production Status:
Existing-updated
Use:
Machine Learning
-
Paper title:Age Suitability Rating: Predicting the MPAA Rating Based on Movie Dialogues
-
Paper track:Written/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Mahsa Shafaei | Movie MPAA Information Corpus | /N |
Documentation:
None




